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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicants : MANFRED T. REETZ ET AL. 
International Serial No: PCT/EP98/04612 

Serial No. : TO BE ASSIGNED 

Filed : HEREWITH 

For : METHOD FOR PRODUCING AND IDENTIFYING NEW 

HYDROLASES HAVING IMPROVED PROPERTIES 

Art Unit : TO BE ASSIGNED 

Examiner : TO BE ASSIGNED 

January 24, 2000 

Hon. Assistant Commissioner 

for Patents 
Washington, D. C. 20231 

PRELIMINARY AMENDMENT 

Sir: 

Prior to examination, please amend the above-identified application as follows: 
IN THE CLAIMS: 

Cancel the original claims and substitute: 
' A: A process for the preparation and identification of hydrolase mutants having 
improved properties with respect to stereo- or regioselectivity, characterized in that 

a) a starting hydrolase gene is mutagenized by a modified polymerase chain reaction 
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(PCR), wherein the mutation rate and total number of mutations in the amplified DNA is 
adjusted by adjusting the concentrations of Mg 2+ , Mn 2+ and of the deoxynucleotides and by 
adjusting the number of cycles; 

b) optionally one or more hydrolase genes mutated according to step a), or 
mixtures of one or more starting hydrolase genes and one or more hydrolase genes 
mutated according to step a) are mutagenized by enzymatically fragmenting said genes, 
followed by enzymatic reassembly of the fragments produced to give complete 
recombinant hydrolase genes; 

c) the mutated hydrolase genes obtained according to step a) or b) are 
transformed into a host organism; and 

d) hydrolase mutants having improved properties, expressed by transformants 
obtained in step c), are identified by a test method. 



^7 The process according to claim 4, wherein an average mutation rate of 1 -2 
base substitutions, per one hydrolase gene to be mutagenized, is adjusted in the PCR in 
step a) by adjusting the concentrations of Mg 2+ , Mn 2+ and of the deoxynucleotides. 
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3? 

-3T The process according to claim 4 ? wherein a hydrolase gene mutagenized 
in a PCR previously performed according to claim 4 is used as the starting hydrolase gene 
in step a). 



REMARKS 

The foregoing amendment place the claims in better form for U.S. examination. 



Early and favorable action is earnestly solicited. 
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Methodfor Producing and Ident ifying New 
Hydrolases Having Improved Properties 

The present invention relates to a process for the preparation and identifi- 
cation of hydrolase mutants having improved properties with respect to 
stereo- or regioselectivity, catalytic activity or stability in chemical reac- 
tions. 

Prior art: 

Hydrolases are among the most wide-spread enzymes in organic synthesis. 
As a subgroup of the hydrolases, esterases and lipases, in particular, 
catalyze a wide variety of reactions, such as the hydrolysis of carboxylic 
acid esters, or the synthesis of esters or transesterifications in organic 
solvents. Due to their high stereoselectivity, stability and their being readily 
available, they are interesting for numerous industrial processes. Thus, for 
example, lipases have been industrially employed for the optical resolution 
of chiral alcohols, acids or amines, for the preparation of optically pure 
medicaments, natural substances, plant protective agents or high-grade 
fats and oils (K. Faber, Biotransformations in Organic Chemistry, Springer- 
Verlag, Berlin, 2nd Ed. 1995). Nevertheless, the enantioselectivity of a 
lipase or esterase with respect to a given substrate cannot be predicted 
with certainty, and in many cases, the reactions proceed with only 
moderate optical yields. Therefore, there is a need for a process for the 
preparation of hydrolases which enables a well-aimed optimization of 
enantioselectivity with respect to a desired product and the special 
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process conditions, such as temperature and solvent. Although effects on 
the enantioselectivity of lipases could be studied using the molecular- 
biological method of in vitro mutagenesis, which is customary today (K. 
Hult, M. Holmquist, M. Martinelle, European Symposium on Biocatalysis, 
Graz, 1993, Abstracts, L-4), an optimization with respect to a particular 
substrate which would have led to an enzyme useful in organic synthesis 
could not be achieved. 

The most important possible applications of genetic engineering include 
protein desig, wherein mutations are introduced base-specifically into the 
gene sequence of the corresponding protein based on known structural 
data using in vitro mutagenesis. By selectively substituting amino acids, 
enzymes having improved catalytical activity or stability could already be 
prepared in this way (A. Shaw, R. Bott, Current Opinion in Structural 
Biology, 1996, 6, 546). This technique, the so-called oligonucleotide- 
directed or site-directed mutagenesis, is based on the substitution of a 
short sequence segment of the gene coding for the naturally occurring 
enzyme (wild type) by a synthetically mutagenized oligonucleotide. 
Subsequent expression of the gene results in an enzyme mutant which 
may have advantageous properties. In a method derived therefrom, the 
so-called cassette mutagenesis, oligonucleotides with partially randomized 
sequences are used. This provides a library of mutants of a limited size, 
which can then be tested with respect to its properties. 

Despite of the advantages of these established methods, they are hardly 
suitable for the stepwise optimization of an enzyme or for the generation 
of enzymes having novel properties. The fact that our understanding of 
the laws governing protein folding and the structure-function relationship 
of proteins is still incomplete is the main reason for the failing of many 
projects in the field of the so-called rational protein design. In addition, a 
stepwise optimization process according to the classical method is rela- 
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tively labor-consuming and does not ensure a significant improvement of 
the enzyme properties per se. 

More recently, novel molecular-biological methods of mutagenesis have 
been described (D.W. Leung, E. Chen, D.V. Goeddel, Technique, 1989, 1, 
11, and W.P.C. Stemmer, A. Crameri, PCT WO 95/22625) which are 
based on the polymerase chain reaction known from the literature (R.K. 
Saiki, S.J. Scharf, F. Faloona, K.B. Mullis, G.T. Horn, H.A. Erlich, N. 
Arnheim, Science, 1985, 230, 1350). Instead of site-directed mutagene- 
sis, these methods employ combinatorial methods for the generation of 
extensive mutant libraries which are subsequently screened for mutants 
having positive properties using suitable screening methods. This mimics 
the naturally occurring evolutive processes of replication and recombina- 
tion, mutation and selection on a molecular level. This method, described 
as in vitro evolution (or directed evolution), has already proven useful in 
some cases as a suitable method for obtaining new biocatalysts (W.P.C. 
Stemmer, Nature, 1994, 370, 389, and F.H. Arnold, Chemical Engineering 
Science, 1996, 51, 5091). 

In spite of the progress made in this field, this method cannot yet be 
generally transferred to all classes of enzymes, since suitable test meth- 
ods for identifying mutants with positive properties are lacking in most 
cases. Such methods are a sine qua non, however, in view of the large 
number of mutated enzyme variants to be expected in the production of 
combinatorial mutant libraries. Especially in the case of the lipases which 
are interesting for industrial processes, the production of mutants with 
improved stereoselectivity by the methods of in vitro evolution has not 
been successful to date, because an efficient screening method for 
enantioselectivity testing still does not exist. The classical method for 
determining the enantioselectivity of a lipase- or esterase-catalyzed 
reaction is based on the separation of the reaction products and educts by 
liquid or gas chromatography using chirally modified stationary phases. 
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However, due to the enormous number of samples to be processed in the 
screening of extensive mutant libraries, this method is unsuitable since 
chromatographical separations with chirally modified columns are time- 
consuming, being only capable of sequential processing. Another as yet 
unsolved problem is the difficulty, frequently to observe, of expressing 
functional lipases or esterases in host organisms with a sufficiently high 
activity yield. However, this is indispensable to a high-performance 
screening system since too low enzyme activities are difficult to detect in 
the determination of enantioselectivity due to the limited sensitivity of a 
test system. 

Object of the invention: 

Therefore, it has been the object of the present invention to provide a 
simple process for the preparation of mutated hydrolases, especially 
lipases or esterases, having improved stereo- or regioselectivity, catalytic 
activity and stability towards particular substrates (e.g., carboxylic acids, 
alcohols, amines, or their derivatives), which process additionally enables 
a rapid identification of positive mutants from extensive mutant libraries, 
and the use of the enzymes thus prepared in the optical resolution of 
chiral alcohols, acids and amines, and their derivatives. 

Description of the invention: 

As a rule, the preparation of the new biocatalysts starts with the isolation 
of a lipase or esterase gene from the organism of origin. This may be any 
microbial, plant and animal organism which is the carrier of a lipase or 
esterase gene. The isolation of the gene can be effected according to the 
methods known from the literature (J. Sambrook, E.F. Fritsch, T. Maniatis, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, 1989, New York). Usually, the genomic DNA is fragmented using 
restriction endonucleases, and the gene fragments obtained are cloned in 
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a host organism (e.g., E. coli). Then, using oligonucleotides with sequence 
homology to a segment of the lipase or esterase gene, the gene is 
identified within the gene library in hybridization experiments, followed by 
isolation thereof. 

Surprisingly, it has been found according to the invention that naturally 
occurring hydrolase genes can be mutagenized by a modified polymerase 
chain reaction (PCR), changing certain reaction parameters, to obtain an 
extensive mutant library which can be screened for mutants having 
improved enantioselectivity using a novel test method. 

The novelty of the process resides in that an extensive randomized 
mutant library can be established, starting with a naturally occurring 
lipase or esterase gene (the so-called wild type gene), using a modified 
PCR (hereinafter referred to as mutagenizing PCR). It has been found that 
the mutation rate during the PCR can be adjusted in a well-aimed manner 
by changing the components of the PCR. The number of mutations in the 
lipase gene in question (the mutation rate) can be controlled by varying 
the concentrations of Mg 2+ and/or of the deoxyoligonucleotides and/or the 
addition of Mn 2+ ions. Preferably, the following concentrations are used 
depending on the DNA polymerase employed: 

Mg 2+ : 1.5 mM - 8.0 mM 
dNTP : 0.05 mM - 1.0 mM 
Mn 2+ : 0.0 mM - 3.0 mM 

In addition, it has been found that the number of cycles in the PCR 
correlates with the number of mutations: the higher the selected number 
of cycles, the higher is the total number of mutations. By means of this 
parameter, the diversity of the mutant library can be adjusted. 
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For determining the mutation rate, the purified PCR products are se- 
quenced. The mutation rate can be determined by comparing the se- 
quences obtained with the sequence of the wild type gene. 

Table 1 shows the mutation rate as a function of the concentration of the 
above mentioned components of the PCR in the amplification of the lipase 
gene from P. aeruginosa (lip A). 



Table 1 



Exp. 



2+ 



Mg 
(mM) 



2+ 



Mn 
(mM) 



dATP/ 
dGTP 
(mM) 



dTTP/ 
dCTP 
(mM) 



Mutation rate 
(mutations/ 
1000 bp 1} ) 



1 
2 



6.1 

7.0 



0.5 



0.2 
0.2 



0.2 
1.0 



i) 



bp = base pairs 



1-2 

15-20 



From the sequencing results, it can further be seen that the transition and 
transversion types of mutation occur in about the same statistical fre- 
quency. In contrast, deletions and insertions are rarely observed. In 
addition, the mutations are uniformly distributed over the entire lipase 
gene. Thus, a mutant library with statistically uniformly distributed 
mutations can be produced by the method described. A mutation rate of 
1-2 mutations/hydrolase gene has proven advantageous. Thereby, it is 
prevented that a negative mutation will mask a mutation with a positive 
effect, as would be the case if several mutations occurred per one hydro- 
lase gene. In order to obtain a complete mutant library, each with one 
amino acid substitution per enzyme molecule, 5415 mutants must theo- 
retically be generated in a lipase consisting of 285 amino acids (here: 
lipase from P. aeruginosa). This value results from the following formula: 



N = 19 x M x 285! / [(285 - M)! x M!] 
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with N = number of mutants, and M = number of amino acid substitutions 
per one lipase molecule. According to the invention, it could be surpris- 
ingly shown that positive mutants are found in even substantially smaller 
sized libraries, a mutation rate of 1-2 having been employed. 

The mutated lipase or esterase genes obtained by the process described 
are ligated into a suitable expression vector and then transformed into a 
host organism, e.g., E. coli. Then, the transformed cells are plated on 
agar plates and cultured. If the expression rate is sufficiently high, the 
colonies obtained can be transferred to microtitration plates provided with 
a liquid medium and, after growth has started, can be directly employed 
in a screening test. In the case where only little enzyme is formed in the 
expression of the lipase gene or the gene product is not correctly folded in 
the host organism used (inclusion bodies) or incompletely secreted into 
the culture medium, it will be advantageous to redone the mutated genes 
in another host organism, preferably the original organism. 

In order to obtain sufficiently high enzyme activities, the individual 
bacterial clones which contain a mutated lipase or esterase gene are 
transferred from the agar plates into the wells of commercially available 
microtitration plates and cultured in liquid medium. Preferably, microtitra- 
tion plates having 96 wells per plate are employed. The growth of the 
bacteria can be monitored by measuring the cell density (OD600 value). It 
is advantageous to inoculate a second microtitration plate in parallel in 
this way in order to have a reference for the later identification of positive 
clones. After the growth of the bacteria glycerol is conveniently added to 
the reference plate, which is then stored at -80 °C until used for identifi- 
cation. If the bacteria are secreting the enzyme into the extracellular 
space (as with the lipase from P. aeruginosa), the cells in the microtitra- 
tion plates are centrifuged off, and the supernatant with the lipase or 
esterase activity is used for the screening test. In the case where the 
bacteria (e.g., E. coli) accumulate the enzyme in the periplasm, a cell wall 
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lysis must be preliminarily done, wherein methods known from the 
literature, such as lysozyme treatment, can be used. 

By culturing the corresponding clones from the reference plate, sufficient 
plasmid DNA can be isolated which can be used for the characterization of 
the mutated lipase or esterase gene. The mutations are localized within 
the gene by sequencing. One advantage of the invention is the fact that 
the mutated gene in a positive clone can be further optimized with respect 
to its properties in further mutation cycles by the process described, even 
without knowing the exact position of the mutations. Thus, the isolated 
lipase or esterase gene is again used in a PCR modified according to the 
above stated conditions (mutagenizing PCR). This procedure may be 
repeated until the properties of the lipase or esterase mutant meet the 
requirements of the stereoselective reaction. 

For a further optimization of the identified positive mutants, the process 
described can be extensive in that the DNA of several positive mutants is 
first fragmented and then can be reassembled into functional lipase or 
esterase genes in a combinatorial process according to W.P.C. Stemmer 
{Nature, 1994, 370, 389). The thus obtained in vitro recombinant library 
is subsequently expressed, and the recombinant gene products are 
examined for improved enantioselectivity using the test methods accord- 
ing to the invention. The advantage of this method is that the positive 
properties of different lipase or esterase mutants may be added in one 
new recombinant gene due to the recombination, which eventually may 
result in a further improvement of the lipase or esterase. The course of 
the method described is as follows: 

Using the enzyme DNase I (e.g., from bovine pancreas), the lipase or 
esterase genes are first cleaved into fragments having a preferably length 
of between 25 bp and 100 bp. The size of the fragments can be checked 
by separating them by means of agarose electrophoresis and comparing 
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with corresponding DNA length markers. The DNA fragments thus ob- 
tained are purified to free them from adhering DNase. The in vitro recom- 
bination is performed under the conditions of a conventional PCR, but 
without adding any PCR primers. In analogy with conventional PCR, one 
cycle is comprised of three steps: a) denaturing, b) annealing and c) 
elongation. During annealing, hybridization occurs of sequence-homo- 
logous fragments which may be derived from different mutated lipase or 
esterase genes. In the subsequent elongation step, the strands are 
completed by the DNA polymerase so that new recombinant lipase genes 
are eventually obtained. The optimum number of cycles is determined in a 
preliminary experiment. Thus, after every 5 cycles, a small sample of the 
reaction mixture is separated by agarose gel electrophoresis to determine 
the cycle in which the maximum of the size distribution of the recombi- 
nants in the range of the size of the enzyme gene. A number of cycles of 
between 30 and 45 is preferably selected. The band obtained in the 
agarose gel which corresponds in size to the lipase or esterase gene is 
purified and amplified by a conventional PCR. The PCR product is purified 
and, following ligation into a suitable vector (plasmid), transformed into E. 
coli. As already discussed in the paragraph dealing with mutagenizing 
PCR, it may be required to redone in another host organism if the lipase 
activity should be too low after expression in E. coli. The recombinants 
obtained are grown in microtitration plates for the test for enantioselec- 
tivity. 

In a variant of the invention, the described methods of mutagenizing PCR 
and in vitro recombination for the production of mutant or recombinant 
libraries can be performed successively or repeated in any order and 
frequency desired in order to optimize the enantioselectivity of the lipase 
or esterase. Preferably, at least one mutation cycle is performed in the 
beginning using mutagenizing PCR. This may then be followed by an in 
vitro recombination cycle, wherein the best positive mutant clones are 
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respectively employed. By monitoring the enantioselectivity of the enzyme 
mutants obtained, the optimization process can be followed. 

In another variant of the invention, positive lipase or esterase mutants 
identified by the screening of mutant or recombinant libraries can be 
further optimized using classical directed mutagenesis or cassette muta- 
genesis. Thus, the mutation in the lipase or esterase gene is first localized 
by sequencing. This gene is subsequently again mutagenized by means of 
"wobbled" primers at the codons coding for positive mutants. The thus 
obtained mutant library of a limited size can then be expressed and 
screened for improved enantioselectivity. 

Positive lipase or esterase mutants identified by the screening of mutant 
or recombinant libraries can be further optimized using site-directed 
saturation mutagenesis. Thus, the positive mutation in the lipase or 
esterase gene is first localized by sequencing. Then, using any method of 
site-directed mutagenesis which allows for the exchange of multiple 
bases, this gene is changed in such a way that all possible codons are 
formed at the site of the gene which codes for the position to be opti- 
mized. This provides a library of mutants of a limited size in which mu- 
tants the amino acid originally present in the amino acid position to be 
optimized has been replaced by the remaining 19 amino acids. The thus 
obtained mutant library of a limited size can then be expressed and 
screened for improved enantioselectivity. 

In a variant of the method described, the lipase or esterase gene of the 
wild type enzyme is employed for in vitro recombination together with the 
positive mutants found. This can result in backcrossings in which muta- 
tions having neutral or negative properties can be eliminated. Following 
expression, the recombinant library obtained can be examined for im- 
proved enantioselectivity. 
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In another variant of the method described, hydrolase genes from differ- 
ent organisms are employed for in vitro recombination, provided they 
possess sufficient sequence homology with the originally employed 
hydrolase gene. 

In a variant of the method, the in vitro recombination is performed under 
the conditions of the modified PCR described. Thus, the concentrations of 
the Mg 2+ or Mn 2+ ions and of the deoxynucleotides (dNTPs) are changed 
to adjust the mutation rate during in vitro recombination in a well-aimed 
manner. 

The invention further relates to test methods which allow for the identifi- 
cation of enzyme mutants having improved stereoselectivity or regiose- 
lectivity from extensive mutant libraries. Thus, after centrifuging off the 
bacterial cells, two aliquots of the enzyme-containing supernatant are 
transferred to adjacent wells of a new microtitration plate. After addition 
of the two enantiomeric pure substrates in the two wells, respectively, the 
activity of the lipase or esterase is determined by spectrophotometry. The 
measurements are performed in a commercially available spectral pho- 
tometer for microtitration plates. This allows for a high sample through- 
put. The selection of the substrate depends on the type of chiral com- 
pound for which optimization of the lipase or esterase is to be effected. 
The method is particularly suitable for chiral carboxylic acids, alcohols and 
amines. 

In the case of chiral carboxylic acids or chiral COOH-functional com- 
pounds, the two corresponding p-nitrophenyl esters of the (R)- and (S)- 
acids are employed as test substrates. Formula 1 shows the principle of 
the test method wherein R represents any organic residue having at least 
one asymmetric center. 
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Formula 1 

Scheme of the test method for stereoselectivity for chiral carboxylic acids 
or COOH-functional compounds 

Reaction 1: 




(R)-Enantiomer 



Reaction 2: 

II I Lipase ii I 

R X 0 XJ — %A OH \ o A^ •"• 

(S)-Enantiomer 

Due to the high absorbance of the p-nitrophenolate anion released in the 
hydrolase-catalyzed ester hydrolysis (W = 405 nm, Emax = 14,000), a 
highly sensitive test method results by which an activity determination 
can be performed even for low substrate concentrations. The enantiose- 
lectivity of the hydrolase mutants can be determined with sufficient 
accuracy from the quotient of the hydrolysis rates V apP (R) and Va PP (s) for 
the (R)- and the fS>ester, respectively. Since both test reactions contain 
only one enantiomer (either the R- or the S-ester), the absence of a 
competing reaction with the other enantiomer must be taken into account 
when the enantioselectivity is determined. Although this kinetic effect may 
lead to the calculation of inaccurate enantioselectivities, it has been found 
that the apparent enantioselectivities obtained by the presented method 
(Eapp) are sufficiently telling with respect to the enantioselectivity of the 
mutated lipases. E apP is obtained as V a pp(R)/Va PP (s). Another advantage is 
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the simple performance and good reproducibility of the test, which is also 
suitable for screening with a high sample throughput. 

In the case of chiral alcohols or chiral OH-functional compounds, fatty acid 
esters of the two optically pure alcohols are employed in the test for 
stereoselectivity. The chain length of the fatty acids is within a range of 
from C2 to Ci8. As the alcohol component, primary, secondary and tertiary 
alcohols and their derivatives having at least one asymmetric center can 
be used. Solutions of the esters of the (R)- and (S>alcohols are hydro- 
lysed with culture supernatants of the hydrolase mutants in adjacent wells 
of a microtitration plate. The hydrolysis rates V apP (R) and V apP (s) for the 
(R)- and the (S>ester, respectively, are a measure of the enantioselec- 
tivity of the enzyme mutant examined. Detection is effected through a 
coupled enzyme reaction (H.U. Bergmeyer, Grundlagen der enzymati- 
schen Analyse, Verlag Chemie, Weinheim, 1977) in which the continuous 
release of the fatty acid is monitored. The dye produced is assayed by 
colorimetry at 546 nm (s = 19.3 l-mmol^-cm" 1 ). The concentrations of the 
enzymes, cofactors and coenzymes of auxiliary reactions 2 and 3 (see 
Formula 2) and of the indicator reaction 4 must be selected in such a way 
that the lipase- or esterase-catalyzed reaction to be determined is rate- 
determining. The quotient of the hydrolysis rates for the (R)- and the (S)- 
ester, respectively, corresponds to the apparent enantioselectivity (E apP ). 
In one variant, the fatty acid amides of chiral amines or NH2- or IMHR- 
functional compounds are employed instead of the optically pure esters. 
Formula 2 shows the scheme of the test system. 
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Formula 2 

Scheme of the test method for stereoselectivity for chiral alcohols; R 
represents any organic residue having at least one asymmetric center; 
abbreviations: CoA (coenzyme A), ATP (adenosine-5'-triphosphate), AMP 
(adenosine-5'-monophosphate) 




free fatty acid 



2. ) free fatty acid + CoA + ATP Acvl-CoA Synthetase^ acyl-CoA + AMP + pyrophosphate 

Acyl-CoA Oxidase 

3 . ) acyl-CoA + 0 2 enoyl-CoA + H 2 0 2 

4. ) H 2 0 2 + 4-aminoantipyrine + 2,4,6-tribromo-3-hydroxybenzoic acid , Peroxidas % red dye + 

2 H 2 0 + HBr 

In a variant of the method, the corresponding esters and amides of 
succinic acid can be employed instead of the fatty acid esters or amides. 
The latter have the advantage, over the fatty acids, of being more soluble 
in aqueous solutions or aqueous-organic solvents. The measurement is 
performed by UV spectrometry at 340 nm (e = 6.3 l-mmo^-cm" 1 ). In this 
test method too, it has to be taken care that the hydrolase-catalyzed 
reaction 1 be rate-determining. The quotient of the hydrolysis rates V apP (R) 
and Vapp(s) for the (R)- and the (Sj-ester, respectively, corresponds to the 
apparent enantioselectivity (E apP ). In one variant, the fatty acid amides of 
chiral amines are employed instead of the optically pure esters. Both 
primary and secondary amines may be employed as the amine compo- 
nent. The scheme of the test system is represented in Formula 3. 
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Formula 3 



Scheme of the test method for stereoselectivity for chiral alcohols; R 
represents any organic residue having at least one asymmetric center; 
abbreviations: CoA (coenzyme A), ITP (inosine-5'-triphosphate), IDP (in- 
osine-5'-diphosphate), NADH/NAD + (reduced/oxidized nicotinamide aden- 
ine dinucleotide) 



« . HOOC. ^\ Lipase or esterase 

1.) XOOR +H 2 0 ' ^^^COOH +R -° H 

free succinic acid 



Succinyl-CoA synthetase 
2. ) Succinic acid + ITP + CoA — ► IDP + succinyl-CoA + phosphate 



Pyruvate kinase 

3.) IDP + phosphoenol pyruvate *~ ITP + pyruvate 



Lactate dehydrogenase 
4 . ) Pyruvate + NADH + H + ' - L-lactate +NAD + 



The test for the identification of hydrolase mutants having improved 
stereoselectivity may further be performed in such a way that both 
stereoisomers are contained in the test reaction. Thus, the separated 
measurements of the (R)- and fS>enantiomers can be dispensed with. 
The test principle starts with binding a racemic mixture of the chiral 
substrate to a solid phase. Through an ester or amide linkage to this 
chiral compound, a radioactively labeled organic residue is bound. Two 
cases can be distinguished: 

a) Solid-phase bound chiral carboxylic acid: the carboxy function is 
esterified with a radioactively labeled alcohol. 
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b) Solid-phase bound chiral alcohol or chiral amine, or OH- or Non- 
functional (or NHR-functional) compounds: the hydroxy or amine 
function is labeled with a radioactively labeled carboxylic acid. 

It is critical that the two enantiomers of the racemic mixture bound to the 
solid phase be labeled with different isotopes. Preferably, 3 H- and Re- 
labeled compounds are used. As the solid phase, all usual organic func- 
tionalized polymers as well as inorganic functionalized supports can be 
employed. Preferably, solid phases based on polystyrene and silica gel 
supports are employed. The chiral radioactively labeled compounds are 
then bound to the solid phase wherein the coupling to the solid phase 
must be adapted to the chemical nature of the chiral substrate. Formula 4 
shows the scheme of the modified solid phase and the principle of the test 
method. 
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Formula 4 

Scheme of the solid-phase screening test for stereoselectivity with a dual 
radioactively labeled substrate; X = 0, NH; R is a radioactively labeled 
organic residue 




Approximately equal amounts of the thus modified support can be dis- 
pensed to small reaction vessels (e.g., the wells of microtitration plates) 
and then admixed with the culture supernatants of the hydrolase mutants. 
In the subsequent reaction, the radioactively labeled components (car- 
boxylic acid or alcohol) are hydrolysed from the solid phase and released 
into the liquid medium. An aliquote of the medium is then removed and 
examined for the amount of radioactivity in a scintillation counter. From 
the ratio between the two different isotopes, the enantiomeric excess and 
the conversion of the reaction and thus the enantioselectivity of the 
mutated esterase or lipase can be calculated. By using regioisomeric test 
compounds, the tests described can also be used for the identification of 
hydrolase mutants having improved regioselectivity. Instead of hydrolase 
mutants, other catalysts may also be employed for determining the 
stereo- or regioselectivity. 

The test for enantioselectivity of the hydrolase mutants prepared by the 
process described may also be performed by a capillary-electrophoretical 
separation using chirally modified capillaries which allow for a direct 
separation of the enantiomeric substrates or products of the hydrolase- 
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catalyzed test reaction. Here, the test substrates can be employed as a 
racemate. The separation may be effected both in capillaries and by the 
use of prepared microchips which allow for electrophoretical separation 
and parallel running of the analyses for a high sample throughput. In both 
cases, it is a precondition that the enantiomers can be separated by 
capillary electrophoresis. 

The invention will now be further illustrated by the following Examples and 
Figures. 

Figure 1 shows the experimentally obtained measured curves for the 
determination of the apparent enantioselectivity (Eapp) in the hydrolysis of 
(R)- and (S>2-methyldecanoic acid p-nitrophenyl ester with culture 
supernatants of the lipase mutants P1B 01-E4, P2B 08-H3, P3B 13-D10, 
P4B 04-H3, P5B 14-C11, P4BSF 03-G10, and the wild type lipase from P. 
aeruginosa (the slopes have the unit [mOD/min]). 

Figure 2: Comparison of the DNA sequences of the lipase mutants P1B 
01-H1, P1B 01-E4, P2B 08-H3, P3B 13-D10, P4B 04-H3, P5B 14-C11 and 
P4BSF 03-G10 S155F with the sequence of the wild type of lipase from P. 
aeruginosa (the mutated bases with respect to the wild type are boxed, 
the origin of the mature lipase mutants is at base 163 or at base 162 in 
the wild type). 



Example 1 

In the following Example, the gene of the lipase from P. aeruginosa 
(isolation according to K.-E. Jager, Ruhr-Universitat Bochum) has been 
used for an optimization. The substrate for which the enantioselectivity of 
the lipase was to be improved was ( r R,S>2-methyldecanoic acid. A lipase 



- 19 - 



mutant with a preference for the (S>enantiomer was to be developed. 
The screening test was performed with (R)- and (S>2-methyl-decanoic 
acid p-nitrophenyl ester. 
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Formula 5 

o 

OH 

(T^SJ^-methyldecanoic acid 

Bacterial strains 
E. coli JM109: 

el4-(McrA), recAl, endhl, g/rA96, thi-1, /7sdR17(rK-mK+), 
Si/pE44, re/Al, A(lac-proAB), [F traA36 proAB /acl q ZAM15] 
(Stratagene) 

P. aeruginosa PABST7.1: 

lacUV5/lacI q controlled T7-polymerase gene stably integrated 
in the chromosome of strain P. aeruginosa PABS, which bears 
a deletion in the structural gene of lipase HpA (K.-E. Jaeger 
et al., J. Mol. Cat Part B, 1997, in press) 

Plasmids 

pMut5: BamHI/Apal fragment (1046 bp) of the P. aeruginosa lipase 
gene HpA in the vector pBluescript KSII (Stratagene) 

pUCPL6A: BamHI/Hindlll fragment (2.8 kbp) comprising the P. aerugi- 
nosa lipase operon in the vector pUCPKS (Watson etal., 
Gene 1996, 172, 163) under the control of the T7 promoter 
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Culturina of bacteria 

E. coli JM109 is grown over night (16 h) at 37 °C in 5 ml of LB medium on 
a test tube roller. For P. aeruginosa PABST7.1, 1 mM IPTG is added to the 
medium. For the screening test, P. aeruginosa PABST7.1 is grown in 
microtitration plates on a rotary shaker, the culture volume being 200 ul 
and the incubation being prolonged to 36-48 h. Antibiotics are added in 
the following concentrations: 

E. coli JM109: ampicillin 100 ug/ml; P. aeruginosa PABST7.1: carbenicillin 
200 ug/ml, tetracyclin 50 ug/ml 

Mutagenizinq PCR 

The lipase gene HpA is amplified using the plasmid pMut5 linearized with 
endonuclease Xmn I as a template and the following PCR primers: 

A: 5 ' - GCG CA ATTAACCCTC ACTAAAGG G AACA AA- 3 ' ; 

B : 5'-GCGTAATACGACTCACTATAGGGCGAA-3 ' 

After purification of the PCR product using a Qiagen Qiaquick Column®, it 
serves as a template in a mutagenic PCR. The reaction conditions are as 
follows: a 100 pi reaction volume contains 16.6 mM (NFU^SCk; 67 mM 
Tris-HCl (pH 8.8); 6.1 mM MgCI 2 ; 6.7 uM EDTA (pH 8.0); 0.2 mM dNTPs; 
10 mM mercaptoethanol; 10 pi of DMSO; 10 pmol each of the primers; 
0.1 ng of template DNA; and 1 U of Taq polymerase (Goldstar, Eurogen- 
tec). The reaction volume is covered with a layer of 100 pi of paraffin. Ten 
parallel reactions were performed which were combined after completion 
of the reaction. The cycling protocol is as follows: A 2 min denaturation at 
98 °C is followed by 25 cycles with 1 min at 94 °C, 2 min at 64 °C, 1 min 
at 72 °C on a Robocycler 40 (Stratagene), followed by incubation for 
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7 min at 72 °C. The Taq polymerase is added after the denaturation of the 
1st cycle. The sequencing of the PCR products yields an error rate of 
about 1-2 base substitutions per 1000 bp. 

Cloning of the PCR products 

The PCR products are precipitated with ethanol and resuspended in 
distilled water. After restriction with Apal and BamHI, the 1046 bp 
fragment formed is purified using a Qiagen Qiaquick Column® and ligated 
into the correspondingly prepared vector pUCPL6A using T4 DNA ligase 
(MBI Fermentas) for 2 h at room temperature. The reaction volume is 
diluted 1:5 and transformed into 200 ul of competent cells of E. co/i 
JM109 prepared by the method of Hanahan (J. Mol. Biol. 1983, 166, 557). 
For this purpose, the DNA and cells are stored on ice for 1 h and incu- 
bated with shaking at 42 °C for 2 min and, after the addition of 700 ul of 
LB medium, at 37 °C for 45 min. The cell suspension is subsequently 
plated onto LB (ampicillin 100 ug/ml) plates. Sixty nanograms of the PCR 
product employed in the ligation reaction will yield about 1500 colonies. 
All colonies are resuspended in sterile LB medium, the plasmid DNA is 
purified and transformed into P. aeruginosa PABST7.1 by electroporation 
according to the method of Farinha and Kropinski (FEMS Microbiol. Lett. 
1990, 70, 221). The 96 wells of the microtitration plates are inoculated 
with one colony each and treated as described in Culturing of bacteria. To 
obtain the culture supernatant, which is to be employed subsequently in 
the test for stereoselectivity, the microtitration plates are centrifuged at 
4000 rpm for 30 min. 

Test for stereoselectivity 

The lipase-containing culture supernatants obtained by centrifugation are 
pipetted in two aliquots into adjacent wells of a microtitration plate. The 
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test volume is 100 ul and is composed of the following components (Table 
2): 
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Table 2 



Composition of the reaction mixture in the test for improved enantiose- 
lectivity of lipase mutants 



After the addition of the Tris/HCI buffer to the supernatants, the microti- 
tration plate is incubated at 30 °C for about 5 min. After addition of the 
substrate solution, the reaction is continuously monitored for 10 min by 
spectrophotometry at 410 nm at 30 °C. From the linear rise of the ab- 
sorption curve, which is a measure of the constant initial rate of the 
hydrolysis, the apparent enantioselectivity (E a p P ) is determined. Thus, the 
slopes measured in the linear region of the initial rates of the reactions for 
the pair of enantiomers are divided by one another to obtain the value of 
the apparent enantioselectivity of the corresponding lipase mutant. 

Determination of stereoselectivity by gas chromatography 

Selected positive clones are grown in 5 ml liquid cultures (LB medium), 
and after centrifugation and removal of the bacterial pellet, the lipase- 
containing supernatant is employed for the reaction. As the substrate, 
100 pi of a solution of racemic (7R,S>2-methyldecanoic acid p-nitrophenyl 
ester (10 mg/ml in dimethylformamide) is used. This solution is admixed 



(R) reaction 



(S) reaction 



50 pi of culture supernatant 
40 pi of 10 mM Tris/HCI 
buffer, pH 7.5 
10 pi of substrate solution 
[10 mg/ml (R)-2-methyl- 
decanoic acid p-nitrophenyl 
ester in DMF] 



50 pi of culture supernatant 
40 pi of 10 mM Tris/HCI 
buffer, pH 7.5 
10 pi of substrate solution 
[10 mg/ml (S)-2-methyl- 
decanoic acid p-nitrophenyl 
ester in DMF] 
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with 700 Ml of 10 mM Tris/HCI buffer, pH 7.5. The reaction is started by 
adding 100 pi of culture supernatant and performed at 30 °C and 
1000 rpm in Eppendorf reaction vessels. After 2.5 h, samples of 200 ul 
each are removed and transferred to an Eppendorf vessel filled with 
200 ul of dichloromethane. After the addition of 25 ul of 20% aqueous 
hydrochloric acid, the products and educts are extracted (vortex shaker, 
1 min). Finally, the organic phase is used for gas-chromatographic ana- 
lysis (GC). Separation of the enantiomers of the free 2-methyldecanoic 
acid is achieved thereby. 

Separation conditions of GC: 

Instrument: Hewlett Packard 5890 

Column: 25 m 2.6 DM 3 Pent p-CD/80% SE 54 

Detector: FID 

Temperature: 230 °C inlet; 80-190 °C with 2 °C/min 

Gas: 0.6 bar hh 

Sample quantity: 0.1 ml 

Results (1st cycle) 

Of the about 1000 clones examined which had been obtained by muta- 
genizing PCR from the starting DNA (wild type gene of lipase from P. 
aeruginosa), 12 were identified to have an improved enantioselectivity 
over that of the corresponding wild type enzyme. Finally, 3 clones were 
selected and their enantioselectivity determined by GC analysis. 
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Table 3 



Selected lipase mutants with improved enantioselectivity (1st cycle) 



Mutant 


Vapp(S) 


Va P p(R) 


Eapp ) 


%ee 


E value 2 ) 




[mOD/min] 


[mOD/min] 




(by GC)/ 


(calculated 










% conversion 


from GC) 


Wild type 


21.8 


14.9 


1.5 


2.4/15.3 


1.1 


P1B01-E4 


128.4 


43.2 


3.0 


36.1/23.2 


2.4 


P1B01-F12 


78.8 


35.7 


2.2 


14.1 /30.5 


1.4 


P1B01-H1 


158.7 


56.2 


2.8 


37.6/4.5 


2.2 



1) Eapp = Vapp(S)/Vap P (R) 

2) E = ln[l-c(H-ee P )]/ln[l-c(l-eep)] with c = conversion, ee P = ee value of the 
product 



The DNA of the clone P1B 01-E4 served as the starting point for a new 
cycle of PCR mutagenization. Thus, the plasmid pUCPL6A was isolated 
from the clone and transformed into E. coli JM109 as described above. 
After the preparation of the plasmid DNA, the 1046 bp fragment was 
obtained by restriction with Apal and BamHI and subsequent purification 
and ligated into the correspondingly prepared plasmid pMut5. After 
transformation and plasmid isolation, this plasmid served as template 
DNA in a mutagenizing PCR under the conditions as described above. The 
DNA obtained from the mutagenizing PCR served to prepare a new 
mutant library (2nd generation). 

Results f2nd cvclel 

From the mutant library of the 2nd generation, about 2200 clones were 
used for the screening test. Ten mutants with an improved enantioselec- 
tivity over that of mutant P1B 01-E4 were identified. Two mutants (P2B 
04-G11 and P2B 08-H3) were examined more closely by GC analysis. 
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Table 4 



Selected lipase mutants with improved enantioselectivity (2nd cycle) 



Mutant 


Vapp(S) 


Vapp(R) 


Eapp ) 


% ee 


E value 2 ) 




[mOD/min] 


[mOD/min] 




(by GC)/ 


(calculated 










% conversion 


from GC) 


P2B04-G11 


224.9 


52.3 


4.3 


47.8 / 30.0 


3.4 


P2B 08-H3 


310.8 


67.4 


4.6 


56.6/19.3 


4.1 



1) Eapp = Va PP (S)/V app (R) 

2) E = ln[l-c(l+eep)]/ln[l-c(l-ee.p)] with c = conversion, ee? = ee value of the 
product 

Clone P2B 08-H3 was used for the next mutation cycle (3rd generation). 
Results (3rd cycle) 

From the mutant library of the 3rd generation, about 2400 clones were 
used for the screening test. One mutant (P3B 13-D10) with an improved 
enantioselectivity over that of mutant P2B 08-H3 was identified. It was 
examined further by GC analysis. 

Table 5 

Selected lipase mutants with improved enantioselectivity (3rd cycle) 



Mutant 


Vapp(S) 


Va PP (R) 


^app ) 


% ee 


E value 2 ) 




[mOD/min] 


[mOD/min] 




(by GC)/ 


(calculated 










% conversion 


from GC) 


P3B 13-D10 


240.0 


35.2 


6.9 


74.8 / 34.6 


10.2 



1) Eapp = Vapp(S)/Vapp(R) 

2) E = ln[l-c(l+eep)]/ln[l-c(l-eep)] with c = conversion, eep = ee value of the 
product 
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Results (4th cycled 

From the mutant library of the 4th generation, about 2000 clones were 
used for the screening test. Four mutants with an improved enantioselec- 
tivity over that of mutant P3B 13-D10 were identified. They were exam- 
ined further by GC analysis. 

Table 6 

Selected lipase mutants with improved enantioselectivity (4th cycle) 



Mutant 


Vapp(S) 


V a pp(R) 


Eapp ) 


%ee 


E value 2 ) 




[mOD/min] 


[mOD/min] 




(by GC)/ 


(calculated 










% conversion 


from GC) 


P4B 04-H3 


355.6 


26.5 


13.4 


81.0/20.0 


11.2 


P4B 01-F2 


162.4 


13.8 


11.7 


82.1 /5.0 


10.6 


P4B 15-G1 


315.4 


28.1 


11.2 


80.0/18.0 


10.7 


P4B 15-H7 


288.0 


25.1 


11.5 


78.4 / 22.0 


10.2 



1) Eapp = Vap P (S)/V app (R) 

2) E = ln[l-c(l+ee P )]/ln[l-c(l-ee P )] with c = conversion, ee P = ee value of the 
product 

The clone P4B04-H3 was inserted in the next mutation cycle (5 th generation). 
Results (5th cycled 

From the mutant library of the 5th generation, about 5200 clones were 
used for the screening test. Two mutants with an improved enantioselec- 
tivity over that of mutant P4B 04-H3 were identified. They were examined 
further by GC analysis. 
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Table 7 



Selected lipase mutants with improved enantioselectivity (5th cycle) 



Mutant 


Vapp(S) 


Vapp(R) 


Eapp ) 


% ee 


E value 2 ) 




[mOD/min] 


[mOD/min] 




(by GC)/ 


(calculated 










% conversion 


from GC) 


P5B 14-C1 1 


275.9 


17.3 


15.9 


77.0 / 43.0 


13.7 


P5B 08-F2 


124.0 


8.7 


14.3 


79.7/40.3 


15.1 



1) Eapp = Vapp(S)/Vapp(R) 

2) E = ln[l-c(l+eep)]/ln[l-c(l-eep)] with c = conversion, eep = ee value of the 
product 



Sequencing of the positive mutants 



By sequencing the positive mutants, the mutations could be localized 
within the lipase genes (see Figure 2). After assigning the base triplets to 
the corresponding amino acids, the following amino acid substitutions 
result with respect to the wild type lipase from P. aeruginosa: 



P1B 01-H1: T103I (Thrl03 -» Ilel03), S149G (Serl49 -» Glyl49) 
P1B 01-E4: S149G (Serl49 -» GK/149) 

P2B 08-H3: S149G (Serl49 -» Glyl49), S155L (Serl55 -> Leul55) 
P3B 13-D10: S149G (Serl49 -» Glyl49), S155L (Serl55 -» Leul55), 

V47G (Val47 ■» Gly47) 
P4B 04-H3: S149G (Serl49 -> Glyl49), S155L (Serl55 Leul55), 

V47G (Val47 Gly47), S33N (Ser33 -> Asn33), F259L 

(Phe259 -> Leu259) 
P5B 14-C11: S149G (Serl49 -> Glyl49), S155L (Serl55 -> Leul55), 

V47G (Val47 -> Gly47), S33N (Ser33 -> Asn33), F259L 

(Phe259 ■> Leu259), K110R (LysllO -> ArgllO) 
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Mutants P1B 01-E4, P2B 08-H3 and P3B 13-D10 were deposited on July 
16, 1997, with the DSMZ - Deutsche Sammlung von Mikroorganismen und 
Zellkulturen GmbH, D-38124 Braunschweig, Mascheroder Weg lb, under 
the designations of DSN! 11 658, DSM 11 659 and DSM 11 659, respec- 
tively. 

Mutants P5B 14-C11 and P4B 04-H3 were deposited on July 20, 1998, 
with the DSMZ - Deutsche Sammlung von Mikroorganismen und Zellkul- 
turen GmbH, D-38124 Braunschweig, Mascheroder Weg lb, under the 
designations of DSM 12 320 and DSM 12 322, respectively. 

Example 2 

The protocols for the culturing of the bacteria, the mutagenizing PCR and 
the test method for enantioselectivity are analogous to those of Example 
1. However, in this Example, the preparation of extensive mutant libraries 
is effected by in vitro recombination. 

The DNA used for the in vitro recombination is either generated by 
mutagenizing PCR or obtained by combining the DNA from any number of 
clones from one or more clone generations formed by repeated mutagen- 
izing PCR. If the PCR products of a mutagenizing PCR are the starting 
point for obtaining DNA for the in vitro recombination, the procedure is as 
follows: The PCR products of the mutagenizing PCR (see Example 1) are 
purified, cleaved with the restriction endonucleases Apa I and BamH I, 
ligated into the correspondingly cleaved vector pMUTS and then trans- 
formed into E. coii JM 109. The plasmid DNA from all transformation 
clones is isolated. If some number of selected clones from one or more 
generations of mutant clones are the starting point for obtaining DNA for 
the in vitro recombination, then the plasmid DNA of the vector pMUT5 is 
isolated and combined with the respective variants of the lipase gene of P. 
aeruginosa. In both cases, the further procedure is as follows: Restriction 
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with the endonuclease Pvu II yields a 1430 bp fragment which comprises 
the binding sites of primers A and B already used in the mutagenizing 
PCR, in addition to the structural gene for the lipase from P. aeruginosa. 
This fragment is purified and cleaved into randomly generated fragments 
by incubation with deoxyribonuclease I (DNase I from bovine pancreas). 
The size of the fragments and the error rate of the subsequent reassem- 
bling can be influenced by selecting the incubation conditions. 

DNase I treatment 

In a total volume of 100 ul, 3 ug of Pvu II fragments in 50 mM Tris/HCI, 
pH 7.5, 10 mM MgCb or 10 mM MnCb, respectively, and 50 ug/ml BSA is 
incubated at 23 °C with 0.075 U DNase I for 10-25 min or 1-10 min, 
respectively. The reaction is terminated by incubation at 93 °C for 10 min. 
Depending on the reaction time, fragments of smaller than 500 bp to 
smaller than 10 bp are obtained. In the case where only a particular 
range of sizes is used, these fragments can be obtained from agarose gels 
by selective electro-blotting on DEAE membrane (according to F.M. Aus- 
ubel et al. r eds., Current Protocols in Molecular Biology, John Wiley and 
Sons, 1989). After purification of the fragments by the Qiagen Nucleotide 
Removal Kit® (Qiagen), the following reassembling reaction is performed. 

Reassembling reaction 

10-30 ng of the fragments derived from the DNase I restriction are 
subjected to the following PCR cycles in 75 mM Tris/HCI, pH 9.0, 20 mM 
(NH4)2S04, 0.01% (w/v) Tween® 20, 1.5 mM MgCb, 0.2 mM dNTPs with 
2 U Goldstar Taq polymerase (Eurogentec) in a total volume of 50 ul: 
2 min at 94 °C, 40 cycles of 1 min at 94 °C, 2 min at 52 °C and 1 min at 
72 °C, finally 7 min at 72 °C. The Taq polymerase is added after the 1 
minute denaturing step of the 1st cycle. 
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PCR 



1 ul from the reassembling reaction is employed in a subsequent PCR 
reaction, which is composed as described for the reassembling reaction , 
with the following differences: instead of the DNase I generated frag- 
ments, 1 ul of the reassembling reaction is employed as the template 
DNA. In addition, primers A and B in a concentration of 0.2 mM and 10% 
dimethylsulfoxide are added. The cycling protocol is as follows: 

2 min at 98 °C, 30 cycles of 1 min at 9 4 °C, 2 min at 64 °C, 1 min at 72 °C 
and finally 7 min at 72 °C; parallel runs are performed. The PCR products 
formed in these reactions are purified, restricted with the Restriction 
endonucleases Apa I and Bam HI and cloned as described in the para- 
graph Mutagenizing PCR of Example 1. 

Results (in vitro recombination): 

Twelve clones of the 1st generation of the mutant library obtained by 
mutagenizing PCR (see Example 1) were used for the in vitro recombina- 
tion. The following clones which had shown improved enantioselectivity in 
the screening test were used: 

P1B 01-A2, P1B 01-A6, P1B 01-D2, P1B 01-D5, P1B 01-E1, P1B 01-E4, 
P1B 01-F3, P1B 01-F11, P1B 01-H1, P1B 01-H3, P1B 01-F12. 

The DNA of these clones recombined according to the procedure described 
above is cloned as stated in the paragraph Mutagenizing PCR, and the 
culture supernatants are employed in the test for enantioselectivity. About 
1000 recombinant clones were tested. The two identified recombinants 
S2A 01-E11 and S2A 02-G3 exhibit a significant improvement of enantio- 
selectivity over the best mutant of the 1st generation (P1B 01-E4) from 
Example 1. 
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Table 8 

Selected lipase mutants with improved enantioselectivity {in vitro recom- 
bination) 



Mutant 


Vapp(S) 


Vapp(R) 


Eapp ) 


% ee 


E value 2 ) 




[mOD/min] 


[mOD/min] 




(by GC)/ 


(calc.from 










% conversion 


GC) 


S2A 01-E11 


145.6 


41.6 


3.5 


41.0/27.0 


2.8 


S2A 02-G3 


210.8 


62.0 


3.4 


38.0 / 23.0 


2.5 



1) Eapp = Vapp(S)/Vapp(R) 

2) E = ln[l-c(l+eep)]/ln[l-c(l-eep)] with c = conversion, eep = ee value of the 
product 

Example 3 

Side-directed saturation mutagenesis in the amino acid position 155 of 
lipase mutant P3B 13-D10: 

Plasmids: 

pMut5 13D10: BamHI/Apal fragment (1046 bp) of the gene of mutant 
P3B 13D10 for the lipase from P. aeruginosa in pBlue- 
script KS II 

pMut5AAK 13D10: Deletion of the Afllll/Kpnl fragment in pMut5 13D10 

A fragment of the gene for the lipase from mutant P3B 13D10 is amplified 
using plasmid pMut5 13D10, linearized by endonuclease XmnI, and the 
following PCR primers: 

A: 5'-GCGCAATTAACCCTCACTAAAGGGAACAAA-3' 
M: 5'-GGTACGCAGAATNNNCTGGGCTCGC-3' 
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where N represents A or C or G or T. 

The reaction conditions are as follows: A 50 ul reaction volume contains 
75 mM Tris/HCI, pH 9.0 (at 25 °C); 20 mM (NH^SCk; 1.5 mM" MgClz; 
0.01% (w/v) Tween® 20; 10% (v/v) DMSO; 10 pmol of each of the 
primers; 0.1 ng of the template DNA; and 2 U of Taq polymerase (Gold- 
star, Eurogentec). The cycling protocol is as follows: A 2 min denaturation 
at 98 °C is followed by 30 cycles with 1 min at 94 °C, 2 min at 64 °C, 
1 min at 72 °C on a Robocycler 40 (Stratagene), followed by incubation 
for 7 min at 72 °C. The Taq polymerase is added after the denaturation of 
the 1st cycle. After purification of the PCR products by agarose gel 
electrophoresis and elution of the DNA from the agarose gel using the 
Nucleospin Extract Kit (Macherey & Nagel), it was used as a primer 
(socalled megaprimer) in a subsequent PCR. Thus, the lipase gene is 
amplified on the plasmid pMut5AAK 13D10, linearized by endonuclease 
XmnI, using the following PCR primers and the above described reaction 
conditions: 

A : 5'-GCGCAATTAACCCTCACTAAAGGGAACAAA-3' 
B (megaprimer): 5'-GCGTAATACGACTCACTATAGGGCGAA-3' 

The reaction conditions and the cycling protocol are as described above, 
except that 1-10 ng of the megaprimer is added to the reaction mixture. 
The cloning of the PCR products is effected as described under Cloning of 
the PCR Products. 

Results (saturation mutagenesis, 3rd generation, P3B13-D10) 

From the mutant library of the saturation mutagenesis (3rd generation, 
P3B 13-D10), about 900 clones were used for the screening test. One 
mutant (P4BSF 03-G10) with an improved enantioselectivity over that of 
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mutant P3B 13-D10 was identified. It was examined further by GC 
analysis. 

Table 9 

Selected lipase mutant with improved enantioselectivity (3rd generation, 
P3B 13-D10) 



Mutant 


Vapp(S) 

[mOD/min] 


Vapp(R) 

[mOD/min] 


Eapp ) 


% ee (by GC)/ 
% conversion 


E value 2 ) 
(calculated 
from GC) 


P4BSF 
03-G10 


384.7 


25.3 


15.2 


87.3/19.0 


20.4 



1) Eapp = Vap P (S)/Va P p(R) 

2) E = ln[l-c(l+eep)]/In[l-c(l-eep)] with c = conversion, ee? = ee value of the 
product 
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Sequencing of the positive mutants 

By sequencing the positive mutants, the mutations could be localized 
within the lipase gene (see Figure 2). After assigning the base triplets to 
the corresponding amino acids, the following amino acid substitution 
resulted with respect to mutant P3B 13-D10: 

P4BSF 03-G10 : L155F (Leul55 -» Phel55) 

Mutant P4BSF 03-G10 was deposited on July 20, 1998, with the DSMZ - 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, D- 
38124 Braunschweig, Mascheroder Weg lb, under the designation of DSM 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Studiengesellschaf t Kohle mbH 

(B) STREET: Kaiser-Wilhelm-Platz 1 

(C) CITY: Muelheim an der Ruhr 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP): 45470 

(ii) TITLE OF INVENTION: A Process for the Preperation and 
Identification of Novel Hydrolases Having Improved 
Properties 

(iii) NUMBER OF SEQUENCES: 21 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GCGCAATTAA CCCTCACTAA AGGGAACAAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCGTAATACG ACTCACTATA GGGCGAA 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85. .1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 

CCATCAACCT GAGAT GAGAA CAAC ATG AAG AAG AAG TAT CTG CTC CCC CTC 111 

Met Lys Lys Lys Tyr Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 159 
Gly Leu Ala He Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu He Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 207 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro He Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTC GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp Phe Gly He 

20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 303 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 
Val Glu Glu He Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu He 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 4 47 
Gly His Ser His Gly Gly Pro Thr He Arg Tyr Val Ala Ala Val Arq 

80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ATC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 
Pro Asp Leu He Ala Ser Ala He Ser Val Gly Ala Pro His Lys Gly 
100 105 no 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 5 43 
Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 140 
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TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TCA CTG GGC TCG CTG 639 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Ser Leu Gly Ser Leu 

145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 735 
Gin Gly He Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 783 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
!95 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lvs 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 879 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 
Gly Met Val He Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
27 5 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 1049 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Lys Lys Tyr Leu Leu Pro Leu Gly Leu Ala He Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 1 5 

Lys Tyr Pro He Val Leu Ala His Gly Met Leu Gly Phe Asp Asn He 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly He Pro Ser Ala Leu Arg Arg Asp 
25 30 35 
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Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

He Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 no us 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Tro 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
2i5 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85. . 1017 



- 41 - 



(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 159 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 207 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTC GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 303 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 399 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 4 47 
Gly His Ser His Gly Gly Pro Thr He Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 
Pro Asp Leu He Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 543 
Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TCA CTG GGC TCG CTG 639 
Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Ser Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 
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CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 735 
Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 783 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 1049 



(2) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu lie Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser His Gly Gly Pro 
75 80 85 
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Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu lie Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly lie Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val lie Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 



Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 85. . 1017 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 163. . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 
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CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gln_ 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 207 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 303 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 399 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 4 47 
Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 
Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 543 
Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 639 
Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 7 35 
Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 
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CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 1049 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 



Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu lie Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser His Gly Gly Pro 
75 80 85 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu lie Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 



Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 
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Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly lie Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val lie Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 84. .1016 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 162. .1016 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGATCCCCGG TTCTCCCGGA AGGATTCGGG CGATGGCTGG CAGGACGCGC CCCTCGGCCC 60 

CATCAACCTG AGATGAGAAC AAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 110 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 158 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 



47 



GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 206 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro He Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTC GGG GTC GAC TAC TGG TTC GGC ATT 254 
Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp Phe Gly He 

20 25 30 

CCC' AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 302 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 350 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 398 
Val Glu Glu He Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu He 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 44 6 
Gly His Ser His Gly Gly Pro Thr He Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 94 
Pro Asp Leu He Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 542 
Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 590 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 140 

TTC CTT TCC AGC GGC AGC ACC GGT ACG CAG AAT TCA CTG GGC TCG CTG 638 
Phe Leu Ser Ser Gly Ser Thr Gly Thr Gin Asn Ser Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 68 6 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 734 
Gin Gly He Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 82 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 830 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 878 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 
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GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 92 6 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 97 4 

Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1016 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC C 1047 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala He Gly Leu Ala 

-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser Thr Tyr Thr Gin Thr 

-10 -5 15 

Lys Tyr Pro He Val Leu Ala His Gly Met Leu Gly Phe Asp Asn He 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly He Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Ser Thr 
135 140 145 150 

Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 



- 49 - 



Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85. .1017 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 



60 



CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 159 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu He Gin 

-10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 207 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro He Val Leu Ala His Gly 
1 5 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp Phe Gly He 

20 25 30 



50 



CCC AGC 
Pro Ser 



AGC CAG 
Ser Gin 



GTG GAG 
Val Glu 
65 

GGC CAC 
Gly His 
80 

CCC GAC 
Pro Asp 



TCG 
Ser 



GAG 
Glu 



TTC 
Phe 



GAG 
Glu 
160 

CAG 
Gin 



GAC 
Asp 



GCA 
Ala 



CTT 
Leu 
145 

TCG 
Ser 



GCC TTG 
Ala Leu 
35 

TTG GAC 
Leu Asp 
50 

GAA ATC 
Glu He 



AGC CAC 
Ser His 



CTG ATC 
Leu He 



ACC GCC 
Thr Ala 
115 

GTC CTC 
Val Leu 
130 

TCC AGC 
Ser Ser 



CTG AAC 
Leu Asn 



CGC CGT GAC GGT 
Arg Arg Asp Gly 



GGC ATC CCC 
Gly He Pro 



GGC GTG AGC 
Gly Val Ser 



CTC GAT 
Leu Asp 



AAC GGC 
Asn Gly 
225 

GGC ATG 
Gly Met 
240 

AAC CAG 
Asn Gin 



CCG 
Pro 
210 

ACC 
Thr 



GTG 
Val 



GTC 
Val 



TAT 
Tyr 
195 

AGC 
Ser 



GCC 
Ala 



ATC 
He 



TTC 
Phe 



ACC 
Thr 



GTC 
Val 



GGC 
Gly 



GCT 
Ala 
100 

GAC 
Asp 



TCC 
Ser 



GGC 
Gly 



AGC 
Ser 



ACC 
Thr 
180 

TAC 
Tyr 



GAC 
Asp 



AAC 
Asn 



CGC 
Arg 



GGC 
Gly 
260 



TCG GAA GTC 
Ser Glu Val 
55 

GCC CTC AGC 
Ala Leu Ser 
70 

GGG CCG ACC 
Gly Pro Thr 
85 

TCC GCC ACC 
Ser Ala Thr 



TTC CTG CGC 
Phe Leu Arg 



GGG CTG GTC 
Gly Leu Val 
135 

GGC ACC GGT 
Gly Thr Gly 
150 

GAG GGT GCC 
Glu Gly Ala 
165 

TCG GCC TGC 
Ser Ala Cys 



GCC CAG GTC 
Ala Gin Val 
40 

CGC GGC GAG 
Arg Gly Glu 



GGC CAG CCC 
Gly Gin Pro 



ATC CGC 
He Arg 



AGC GTC 
Ser Val 
105 

CAG ATC 
Gin He 
120 

AAC AGC 
Asn Ser 



TAC 
Tyr 
90 

GGC 
Gly 



CCA 
Pro 



CTC 
Leu 



ACG CAG AAT 
Thr Gin Asn 



TCC TGG AGC 
Ser Trp Ser 



GCC 
Ala 



GAC 
Asp 



GAC 
Asp 
245 

CTC 
Leu 



TTC 
Phe 



GGC 
Gly 
230 

AAC 
Asn 



CTC 
Leu 
215 

CTG 
Leu 



TAC 
Tyr 



GCG CGC 
Ala Arg 



GGC GAA 
Gly Glu 
185 

GGT TCC 
Gly Ser 
200 

GGC GCC 
Gly Ala 



TTC 
Phe 
170 

GGC 
Gly 



TCG 
Ser 



TCG 

Ser 



TAC GTC 
Tyr Val 



CAG TTG 
Gin Leu 
60 

AAG GTC 
Lys Val 
75 

GTC GCC 
Val Ala 



GCC CCG 
Ala Pro 



CCG GGT 
Pro Gly 



GGC GCG 
Gly Ala 
140 

TTA CTG 
Leu Leu 
155 

AAC GCC 
Asn Ala 



GCC TAC 
Ala Tyr 



ACC GAA GGC 
Thr Glu Gly 
45 

CTG CAA CAG 
Leu Gin Gin 



AAC CTG ATC 
Asn Leu He 



GCC 
Ala 



CAC 
His 



TCG 
Ser 
125 

CTG 
Leu 



GTA CGT 
Val Arg 
95 

AAG GGT 
Lys Gly 
110 

GCC GGC 
Ala Gly 



ATC AGC 
He Ser 



GGC TCG CTG 
Gly Ser Leu 



AAG 
Lys 



AAG 
Lys 



CCG CTG 
Pro Leu 



ACC AGC 
Thr Ser 



GTC GGC ACC 
Val Gly Thr 



CGG ATG AAC 
Arg Met Asn 
250 

CTG TTC GAG 
Leu Phe Glu 
265 



TCG 
Ser 



TGC 
Cys 
235 

CAC 
His 



CTG 
Leu 
220 

AGT 
Ser 



CTG 
Leu 



ACC 
Thr 
205 

ACC 
Thr 



TAC CCG 
Tyr Pro 
175 

GTC AAC 
Val Asn 
190 

AAC TTC 
Asn Phe 



TTC AAG 
Phe Lys 



ACC AGC 
Thr Ser 



TCG CAC CTG 
Ser His Leu 



GAC GAG GTG 
Asp Glu Val 
255 

CCG GTC AGC 
Pro Val Ser 
270 



303 



351 



399 



447 



495 



543 



591 



639 



687 



735 



783 



831 



879 



927 



975 
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GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 

Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 

275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 1049 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 1 5 . 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu lie Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser His Gly Gly Pro 
75 80 85 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu lie Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly lie Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 
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Leu Gly Ala Ser 
215 

Leu Val Gly Thr 



Tyr Arg Met Asn 
250 



Ser Leu Phe Glu 
265 



Arg Leu Lys Asn 
280 



Ser Leu Thr Phe 

220 

Cys Ser Ser His 
235 

His Leu Asp Glu 



Thr Ser Pro Val 
270 

Ala Ser Leu 
285 



Lys Asn Gly Thr 
225 

Leu Gly Met Val 
240 

Val Asn Gin Val 
255 

Ser Val Tyr Arg 



Ala Asn Asp Gly 
230 

lie Arg Asp Asn 
245 

Phe Gly Leu Thr 
260 

Gin His Ala Asn 
275 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85, ,1017 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 207 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AAC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 303 
Pro Asn Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 
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GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 399 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 4 47 
Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 
Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 543 
Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 591 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 639 
Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 
145 150 155 

' GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCT TAC AAG GTC AAC 7 35 
Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 783 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC CTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Leu Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CCG 1050 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Asn Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 GO 65 70 

Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 
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Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Leu Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85, . 1017 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 207 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 303 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 



GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu He 
65 70 75 
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GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 4 47 
Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AGG GGT 495 
Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Arg Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 543 
Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 591 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 639 
Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGT GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro- 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCT TAC AAG GTC AAC 735 
Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC CTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Leu Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 104 9 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 

-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 

-10 -5 15 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu lie Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser His Gly Gly Pro 
75 80 85 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu lie Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Arg Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly lie Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val lie Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Leu Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85. . 1017 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 163. .1017 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 

CCATCAACCT GAG AT G AGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 303 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 4 47 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 



CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 
Pro Asp Leu He Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 
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TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 543 
Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 591 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ATC GGT ACG CAG AAT TTT CTG GGC TCG CTG 639 
Phe Leu Ser Ser Gly Gly lie Gly Thr Gin Asn Phe Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 735 
Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 783 
Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 1049 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 



Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 
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Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Gly He 
135 140 145 150 

Gly Thr Gin Asn Phe Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCGCAATTAA CCCTCACTAA AGGGAACAAA 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGTACGCAGA ATNNNCTGGG CTCGC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GCGTAATACG ACTCACTATA GGGCGAA 
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CLAIMS: 



A process for the preparation and identification of hydrolase mu- 
tants having improved properties with respect to stereo- or regiose- 
lectivity, catalytic activity or stability, characterized in that 

a) a starting hydrolase gene is mutagenized by a modified poly- 
merase chain reaction (PCR), wherein the mutation rate and total 
number of mutations in the amplified DNA is adjusted by adjusting 
the concentrations of Mg 2+ , Mn 2+ and of the deoxynucleotides and by 
adjusting the number of cycles; and/or 

b) one or more starting hydrolase genes, one or more hydrolase 
genes mutated according to step a), or mixtures of one or more 
starting hydrolase genes and one or more hydrolase genes mutated 
according to step a) are mutagenized by enzymatically fragmenting 
said genes, followed by enzymatic reassembly of the fragments pro- 
duced to give complete recombinant hydrolase genes; 

c) the mutated hydrolase genes obtained according to step a) or b) 
are transformed into a host organism; and 

d) hydrolase mutants having improved properties, expressed by 
transformants obtained in step c), are identified by a test method. 

The process according to claim 1, wherein an average mutation rate 
of 1-2 base substitutions, per one hydrolase gene to be mutagenized, 
is adjusted in the PCR in step a) by adjusting the concentrations of 
Mg 2+ , Mn 2+ and of the deoxynucleotides. 
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3. The process according to claim 1, wherein a hydrolase gene muta- 
genized in a PCR previously performed according to claim 1 is used as 
the starting hydrolase gene in step a). 

4. The process according to claim 1, wherein the enzymatic fragmenta- 
tion of the hydrolase genes in step b) is performed using a deoxyri- 
bonuclease. 

5. The process according to claim 1, wherein the reassembly of the 
fragments in step b) is effected enzymatically by means of a thermo- 
stable DNA polymerase using temperature cycles in which the para- 
meters of temperature and duration of cycles are adjusted. 

6. The process according to claim 1, wherein the mutation rate is ad- 
justed during the enzymatic reassembly in step b) by adjusting the 
concentrations of Mg 2+ , Mn 2+ and of the deoxynucleotides. 

7. The process according to claim 1, wherein the completely recombined 
hydrolase genes are amplified by a polymerase chain reaction in step 
b) after completion of the reassembly reaction. 

8. The process according to claim 1, wherein either modified hydrolase 
genes obtained from step a) according to claim 1 or 2 or several hy- 
drolase genes mutagenized according to claim 3 are subjected to 
fragmentation and reassembly in step b). 

9. The process according to claim 1, wherein synthetically prepared 
gene fragments are additionally used for the reassembly in step b). 

10. The process according to claim 1, wherein hydrolase gene fragments 
from different organisms sharing a sequence homology of at least 
60% can be used for the reassembly in step b). 
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11. The process according to claim 2 or 6, wherein the hydrolase mutants 
are lipase or esterase mutants, and the concentration of the magne- 
sium ions is from 1.5 to 8.0 mM, preferably from 5.8 to 6.4 mM, and 
the concentration of the manganese ions is from 0.0 to 3.0 mM, pref- 
erably < 0.3 mM. 

12. The process according to claim 2 or 6, wherein the hydrolase mutants 
are lipase or esterase mutants, and the concentration of the deoxy- 
nucleotide triphosphates is from 0.05 to 1.0 mM, preferably 0.2 mM. 

13. The process according to claim 1, wherein for the test for stereo- or 
regioselectivity of the hydrolase mutants in step d), a test substrate 
is provided with a chromophorous group which causes a spec- 
trometrically determined change of absorption or emission upon 
cleavage by the catalyst, and equal amounts of the hydrolase mu- 
tants are added to the pure stereo- or regioisomers of the test sub- 
strate in separate test vessels, and the stereo- or regioselectivity 
can be determined from the ratio of the linear initial reaction rates 
obtained. 

14. The process according to claim 13, wherein the stereo- or regioi- 
somers of a compound with a UV/VIS-active or fluorescence-active 
molecular group bound through a carboxylic acid ester or carboxylic 
acid amide linkage are used as the test substrate. 

15. The process according to claim 14, wherein said UV/VIS-active 
molecular group is a p-nitrophenyl residue. 

16. The process according to claim 1, wherein the test for stereo- or 
regioselectivity in step d) is effected through determination of the 
change of concentration with time of free fatty acids or succinic acid, 
wherein the corresponding stereo- or regioisomeric carboxylic acid 
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esters or amides are hydrolyzed in separate vessels by means of the 
hydrolase mutants to give free fatty acids or succinic acid. 

17. The process according to claim 1, wherein the test for stereo- or 
regioselectivity in step d) is effected through measuring the radioac- 
tivity, wherein the hydrolase mutants are reacted with stereo- or re- 
gioisomers having different radioactive labels in one functional group, 
and wherein the mixture of the stereo- or regioisomers is fixed on a 
support. 

18. The process according to claim 17, wherein one of the stereo- or 
regioisomers of the support-bound mixture of isomeric compounds is 
labeled with the radioisotope 3 H, and the other stereo- or regioisomer 
is labeled with the radioisotope 14 C. 

19. The process according to claim 1, wherein the test for stereoselectiv- 
ity in step d) is effected through the capillary-electrophoretic deter- 
mination of the reaction products and educts of a test reaction, the 
separation of the stereoisomeric reaction products and educts being 
performed in chirally modified capillaries. 

20. The process according to claims 13 to 19, wherein several reactions 
are performed in parallel in microtitration plates. 

21. The process according to claim 1, wherein the position of the codon 
coding for the changed amino acid is localized by sequencing in the 
mutants having improved properties identified in step d), followed by 
generating a set of hydrolase genes with all possible codons for this 
position by means of site-directed saturation mutagenesis, and the 
mutated hydrolase genes thus obtained are further treated in analogy 
with steps c) and d) of claim 1. 
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22. The process according to claim 21, wherein the localization of the 
position of the codon coding for the changed amino acid is effected 
through DNA sequencing. 

23. A hydrolase mutant obtainable by a process according to one or 
more of claims 1 to 22. 

24. The hydrolase mutant according to claim 23 which is a lipase 
mutant. 

25. The hydrolase mutant according to claim 23 which is an esterase 
mutant. 

26. The hydrolase mutant according to claim 24 which is a lipase 
mutant of the starting lipase from the strain P. aeruginosa. 

27. The hydrolase mutant according to claim 26 which is obtainable by 
expression from the transformants P1B 01-E4 (DSM 11 658), P2B 
08-H3 (DSM 11 659), P3B 13-D10 (DSM 11 660), P4B 04-H3 (DSM 
12 322), P5B 14-C11 (DSM 12 320) or P4BSF 03-G10 (DSM 12 
321). 

28. The hydrolase mutant according to claim 24 which has the amino 
acid sequence of the mature proteins shown in SEQ ID NOS. 4, 6, 8, 
12, 14, 16 or 18. 

29. A DNA sequence coding for a hydrolase mutant according to one or 
more of claims 23 to 28. 

30. The DNA sequence according to claim 29 which comprises a DNA 
sequence shown in SEQ ID NOS. 3, 5, 7, 11, 13, 15 or 17. 

31. A vector comprising a DNA sequence according to claim 29 or 30. 
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32. A transformant comprising a DNA sequence according to claim 29 or 
30 and/or a vector according to claim 31. 

33. The transformant according to claim 32 which is transformant P1B 
01-E4 (DSM 11 658), P2B 08-H3 (DSM 11 659), P3B 13-D10 (DSM 
11 660), P4B 04-H3 (DSM 12 322), P5B 14-C11 (DSM 12 320) or 
P4BSF 03-G10 (DSM 12 321). 

34. A process for the preparation of hydrolase mutants having improved 
properties, comprising culturing a transformant according to claim 
32 or 33. 

35. A method for testing catalysts for stereo- or regioselectivity, 
wherein equal amounts of the catalyst are added to a test substrate 
and to the pure stereo- or regioisomers of the test substrate, pro- 
vided with a chromophorous group which causes a spectrometrically 
determinable change of absorption or emission upon cleavage by 
the catalyst, in separate test vessels, and the stereo- or regioseiec- 
tivity is determined from the ratio of the linear initial reaction rates 
obtained. 
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Abstract 



The present invention relates to a process for the preparation and identifi- 
cation of hydrolase mutants having improved properties with respect to 
stereo- or regioselectivity, catalytic activity or stability in chemical reac- 
tions. 
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Wild type ggatcccc ggttctcccgg 19 

P1B OX-HX GGATCCCCCGGTTCTCCCGG 20 

P1B 0X-E4 GGATCCCCCGGTTCTCCCGG 20 

P2B 08-H3 GGATCCCCCGGTTCTCCCGG 20 

P3B X3-DXO GGATCCCCCGGTTCTCCCGG 20 

P4B 04-H3 GGATCCCCCGGTTCTCCCGG 20 

PSB X4-CX1 GGATCCCCCGGTTCTCCCGG 20 

P4BSP 03-GXO GGATCCCCCGGTTCTCCCGG 20 



-T ' " 1" 

20 30 



Wild tVDe AAGGATTCGGGCGATGGCTG 39 

piR m-Hi AAGGATTCGGGCGATGGCTG 40 

AAGGATTCGGGCGATGGCTG 40 

AAGGATTCGGGCGATGGCTG 40 

P3B X3-DX0 AAGGATTCGGGCGATGGCTG 40 

P4B 04-H3 AAGGATTCGGGCGATGGCTG 40 

PSB 14-C11 AAGGATTCGGGCGATGGCTG 40 

P4BSF 03-G10 AAGGATTCGGGCGATGGCTG 40 



40 50 
_i , . J- 



Wild type gcaggacgcgcccctcggcc 59 



GCAGGACGCGCCCCTCGGCC 60 
GCAGGACGCGCCCCTCGGCC 60 
GCAGGACGCGCCCCTCGGCC 60 



P3B X3-DX0 GCAGGACGCGCCCCTCGGCC 60 

P4B 04-H3 GCAGGACGCGCCCCTCGGCC 60 

PSB 14-C11 GCAGGACGCGCCCCTCGGCC 60 

P4BSF 03-GXO GCAGGACGCGCCCCTCGGCC 60 

i ■ 1 " 

60 70 

Wild tVDe CCATCAACCT GAGAT GAGAA 79 

dip nn ui CCATCAACCTGAGAT GAGAA 80 

pir i F4 CCATCAACCTGAGATGAGAA 80 

Mtt or S3 CCATCAACCTGAGATGAGAA 60 

P3B X3-DX0 CCATCAACCTGAGATGAGAA 80 

P4B 04-H3 CCATCAACCTGAGATGAGAA 80 

PSB X4-CXX CCATCAACCTGAGATGAGAA 80 

P4BSF 03-GXO CCATCAACCTGAGATGAGAA 80 

i — 1 — 

80 90 



WilH tvnn CAACATGAAGAAGAAGTCTC. 99 

™" n^Ei CAACATGAAGAAGAAG T0T C 100 

C A A C A T G A A G A A G A A G T C T C 100 

CAACATGAAGAAGAAGTCTC 100 

CAACATGAAGAAGAAGTCTC 100 

CAACATGAAGAAGAAGTCTC 100 

CAACATGAAGAAGAAGTCTC 100 

P4BSF 03-GXO CAACATGAAGAAGAAGTCTC 100 



100 110 



Wild tVDfi TGCTCCCCCTCGGCC.TGGCC 119 

^ _ _ _ _ **^^^-rr*<~^r>rsTr*r±r*r* ton 
PXB OX-HX 
PXB 0X-E4 
P2B 08-H3 
P3B X3-DX0 

P4B 04-H3 TGCTCCCCCTCGGCCTGGCC 120 

PSB X4-CXX TGCTCCCCCTCGGCCTGGCC 120 

P4BSF 03-GXO TGCTCCCCCTCGGCCTGGCC 120 



TGCTCCCCCTCGGCCTGGCC 120 

TGCTCCCCCTCGGCCTGGCC 120 

TGCTCCCCCTCGGCCTGGCC 120 

TGCTCCCCCTCGGCCTGGCC 120 
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120 



130 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 

P1B 01- E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-113 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14 -CXI 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
PSB 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 0 8-H3 
P3B 13-D10 
P4B 04-H3 
PSB 14-C11 
P4BSF 03-G10 



G 
G 
G 
G 
G 
G 
G 
G 



140 



150 



G 
G 
G 
G 
G 
G 
G 
G 



C T 
O T 



G A 
G A 



G 
G 
G 
G 
G 
G 



T C C A G G 
T C C A G 
C 
C 

c 
c 
c 
c 



c 
c 
c 
c 
c 
c 



160 



170 



c 
c 
c 
c 
c 
c 
c 
c 



180 



190 



T C G 
T C G 
C 
C 
C 

c 
c 
c 



139 
140 
140 
140 
140 
140 
140 
140 



159 
160 
160 
160 
160 
160 
160 
160 



179 
180 
180 
180 
180 
180 
180 
180 



199 
200 
200 
200 
200 
200 
200 
200 



200 



210 



G 
G 
G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 
T 
T 



— i — 
220 



230 



C 

c 
c 
c 
c 
c 
c 
c 



219 
220 
220 
220 
220 
220 
220 
220 



239 
240 
240 
240 
240 
240 
240 
240 
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Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-Cll 
P4BSF Q3-G10 



240 
t 

T 
T 
T 
T 
T 
T 
T 



C 
C 
C 
C 
C 
C 

c 



250 
. i 
G 
G 
G 
G 
G 
G 
G 



C A 

C A 

C A 

C A 

C A 



G 
G 
G 
G 
G 

C A [A] 260 
C A G 260 



259 
260 
260 
260 
260 



TACTGGTTCGGCATTCCCAG 260 



Wild type 

P1B 01-H1 

P1B 01-E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

P5B 14-C11 
P4BSF 03^G10 



260 



270 



C G C C T T 
C G C C T T 
C G C C T T 



G C G C C ,G 
G C G C C G 
G C G C C G 



TGACGGTG 279 
TGACGGTG 280 
TGACGGTG 280 



CGCCTTGC G-C CGTGACGGTG 280 

CGCCTTGCGCCGTGACGGTG 280 

CGCCTTGCGCCGTGACGGTG 280 

CGCCTTGCGCCGTGACGGTG 280 

CGCCTTGCGCCGTGACGGTG 280 



280 



290 



Wild type 

P1B 01 -HI 

P1B 01-E4 

P2B 08 -H3 

P3B 13-D10 

P4B 04-H3 

PSB 14-C11 
P4BSF 03-G10 



299 
300 
300 
300 
300 
300 



CCCAGGTCTACGTCACCGAA 300 
CCCAGGTCTACGTCACCGAA 300 



300 



— , — 
310 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
PSB 14-C11 
P4BSF 03-G10 



319 
320 
320 
320 
320 
320 
320 
320 



320~ 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13 -mo 
P4B 04-H3 
PSB 14-C11 
P4BSF 03-G10 



C- G 
C G 



330 

JL. 



G A 
G A 



339 
340 
340 
340 
340 
340 
340 
340 



340 



350 



Wild type 

P1B Ol-Hl 

P1B 01-E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

PSB 14-C11 
P4BSF 03-G10 



A 
A 
A 
A 
A 
A 
A 
A 



359 
360 
360 
360 
360 
360 
360 
360 
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360 



370 



Wild tVOe ATCG TCGCCCTCAG-CG G C C A 379 

IVl n Si ATCGTCGCCCTCAGCGGCCA 380 

lit ol'zl ATCGTCGCCCTCAGCGGCCA 380 

P2B 08-H3 ATCGTCGCCCTCAGCGGCCA 380 

P3B 13-D10 ATCGTCGCCCTCAGCGGCCA 380 

P4B 04-H3 A T C G T C G C C A 6 C G 380 

GTCGCCCTCAGCGGCCA 380 

TCGCCCTCAGCGGCCA 380 



P5B 14-C11 A T C 

P4BSF 03-G10 A T C G 



"~i — 
380 



— I 
390 



wiw type tTTTTrriTTITTTTTI JSSS 
- Si=5 1 ! g S J J S S ? S i i S S ? S i ? S S S 

?f"nnn GCCCAAGGTCAACCTGATCG 400 

SS G C C C ^ A S G T C A AC C T G AT C G 400 

CAAGGTCAACCTGATCCa 40U 

CAAGGTCAACCTGATCG 400 



P5B 14-C11 G C C 

P4BSF 03-G10 G C C 



400 



410 



»*/m-ix " ^rrACAGC CACGGCGGGCCG 419 

Wild type GCCACAGCCACGGCGGGCCG 420 

P1B 01-H1 rcCACAGCCACGGCGGGCCG 420 

P1B 01-E4 RCCACAGCCACGGCGGGCCG 420 

P2B 08-H3 rrCACAGCCACGGCGGGCCG 420 

52 EE 0 «c^^ s ?h?§ggSSS?§2S 

P5B 14-C11 GCCACAGCCACGGCGGGCCG 420 

P4BSF 03-G10 GCCACAGCCACGGCGGGCCG 420 



420 



430 



wiw type TrnTTTTfTTTTTTSTTT! S 

P1B 01-H1 ACCATCCGCTACGTCGCCGC 440 

P1B 01-E4 ACCATCCGCTACGTCGCCGC 440 

P2B 08-H3 ACCATCCGCTACGTCGCCGC 440 

P3B 13-D10 ArrATCCGCTACGTCGCCGC 440 

P4B 04-H3 ACCA ^^^-I-VoI-T<~r-rCGC 440 

psb 14-cii ACCATCCG Sl A o?^rrrcGC 440 

P4BSF 03-610 ACCATCCGCTACGTCGCCGC 440 



440 



— i — 
450 



wiw type tTTTTITTITlTTrriTTTT S 

S S^S CGTACGTCCCGACCTQATCG 460 

P2B 08-H3 rGTACGTCCCGACCTGATCG 460 

ill aSi* f S i | % T = CO G J O O T 3 J TO . - 

P4BSF 03-G10 CGTACGTCCCGACCTQATCG 460 



-*=: r 



460 



470 _ 

CCAGCGT C G G <J 479 



Wilrl t\/n*i C T T C C G C CACCAGCt.1 o^uv. y~> 

Wild type cTTCCGCCA [T] CAGCGT CGGC 480 

P1B 01-H1 CTTCCGCCACCAGCGTCGGC 480 

P1B 01-E4 St^CCGCCACCAGCGTCGGC 480 

III ITllo CTTCCGCCACCAGCGTCGGC 480 

P4B 04IH3 0 CTTCCGCCACCAGCGTCGGC 48 

TTCCGCCACCAGCGTCGGC 480 



PSB 14-C11 Ci . ^, ~ ~ ~ - * * ^ r* A*n 

P4BSF 03-G10 CT TCCGCCACCAGCGTCGGC 480 



+ 
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480 490 

Wild type gccccgcacaagggttcgga 499 

P1B 01-Hl GCCCCGCACAAGGGTTCGGA 500 

P1B 01-E4 GCCCCGCACAAGGGTTCGGA 500 

P2B 08-H3 GCCCCGCACAAGGGTTCGGA 500 

P3B 13-D10 GCCCCGCACAAGGGTTCGGA 500 

P4B 04-H3 GCCCCGCACAAGGGTTCGGA 500 

P5B 14-C11 QCCCCGCACA[G]GGGTTCGGA 500 

P4BSF 03-G10 GCCCCGCACAAGGGTTCGGA 500 



P1B 01-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 



— ! ~ 1 

500 510 



Wild tVDe CACCGCCGACT ( TCCT G C G C C 519 

pVr ni-Hi CACCGCCGACTTCCTGCGCC 520 

CACCGCCGACTTCCT GCGCC 520 

CACCGCCGACTTCCTGCGCC 520 

CACCGCCGACTTCCTGCGCC 520 

P4B 04-H3 CACCGCCGACTTCCTGCGCC 520 

P5B 14-C11 CACCGCCGACTTCCTGCGCC 520 

P4BSF 03-G10 CACCGCCGACTTCCTGCGCC 520 



520 530 



Wild tVDe AGATCCCACCGGGT T C G G C C 539 

pib 01-Hl AGATCCCACCGGGTT CGGCC 540 

III 01-E4 AGATCCCACCGGGTTCGGCC 540 

PXB01 AGATCCCACCGGGTTCGGCC 540 

AGATCCCACCGGGTTCGGCC 540 

AGATCCCACCGGGTTCGGCC 540 

PSB 14-C11 AGATCCCACCGGGTTCGGCC 540 

P4BSF 03-G10 AGATCCCACCGGGTTCGGCC 540 



P2B 08-H3 
P3B 13-D10 
P4B 04-H3 A G A 



-I ~ " 1 — 

540 550 



GGCGAGGCAGTCCTCTCCGG 559 

Wild tVDe GGCGAGGCAGTCCTCTCCGG 560 

pib 01-E4 GGCGAGGCAGTCCTCTCCGG 560 

P2B 08-H3 GGCGAGGCAGTCCTCTCCGG 560 

lit S3-D10 GGCGAGGCAGTCCTCTCCGG 560 

P3B 13 Diu GGCGAGGCAGTCCTCTCCGG 560 

GGCGAGGCAGTCCTCTCCGG 560 



P4B 04-H3 

PSB 14-C11 « - - - _ m 

P4BSF 03-G10 GGCGAGGCAGTCCTCTCCGG 560 



— j — 1 — 

560 570 



WiiH tvnP GCTGGTCAACAGCCTCGGC G 579 

Sm S 5f GCT GGT CAA CA'GCCT C GGCG 580 

III ol'll GCTGGTCAACAGCCTCGGCG 580 

III ol-lt GCTGGTCAACAGCCTCGGCG 580 

III 13-D10 GCTGGTCAACAGCCTCGGCG 580 

III 04-H3 GCTGGTCAACAGCCTCGGCG 580 

PSB 14-C11 GCTGGTCAACAGCCTCGGCG 580 

P4BSF 03-G10 GCTGGTCAACAGCCTCGGCG 580 



-r 



580 " 590 



WildtVDe CGCTGATCAGCTTCCTTTCC 599 

PiR 01 HI CGCTGATCAGCTTCCTTTCC 600 

III 01-E4 CGCTGATCAGCTTCCTTTCC 600 

III 08-H3 C GCTGATCA °^:^ TT TCC Z 

P3B 13-D10 CGCTGATCAGCTTCCTTTCC 600 

P4B 04-H3 CGCTGATCAGCTTCCTTTCC 600 

PSB 14-C11 CGCTGATCAGCTTCCTTTCC 600 

P4BSF 03-G10 CGCTGATCAGCTTCCTTTCC 600 



+ 
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600 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14 -CXI 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01 -HI 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF O3-G10 



A 
A 
A 
A 
A 
A 
A 
A 



C (_AJ G 
C G G 
G 
G 
G 
G 
G 
G 



610 

i 

C 
C 

c 
c 
c 
c 
c 



A[T]C 



c 
c 
c 
c 
c 
c 
c 
c 



I — 
620 



630 



G 
G 
G 
G 
G 
G 
G 
G 



T 
T 
T 

T|T]c 



G 
G 
G 
G 

"_G 
G 
G 
G 



G 
G 
G 
G 
G 
G 
G 
G 



619 
620 
620 
620 
620 
620 
620 
620 



639 
640 
640 
640 
640 
640 
640 
640 



— i — 

640 



650 



C 
C 

c 
c 
c 
c 
c 
c 



C G A G G G T 
C G A G G G T 
C G A G G G T 
C G A G G G T 
C G A G G G T 
CGAGGGT 
C A G CD G A G G G T 
CAGCGAGGGT 



C A 

C A 

C A 

C A 

C A 

C A 



G 
G 
G 
G 
G 
G 



660 



670 



G 
G 
G 
G 
G 
G 
G 
G 



C 

c 
c 
c 
c 
c 
c 
c 



669 
660 
660 
660 
660 
660 
660 
660 



679 
680 
680 
680 
680 
680 
680 
680 



680 



690 



G 
G 
G 
G 
G 
G 
G 
G 



G G G C A T 
G G G C A T 



C C 
C C 



699 
700 
700 
700 
700 
700 
700 
700 



700 



— i — 
710 



C 
C 

c 
c 
c 
c 
c 
c 



c 
c 
c 
c 
c 
c 
c 
c 



719 
720 
720 
720 
720 
720 
720 
720 
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720 



730 



Wild type 

P1B Ol-Hl 

P1B 01-E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

P5B 14-C11 
P4BSF 03-G10 



G 


C 


c 


T 


A 


C 


A 


A 


G 


G 


T 


C 


A 


A 


C 


G 


G 


C 


G 


T 


739 


G 


C 


c 


T 


A 


C 


A 


A 


G 


G 


T 


C 


A 


A 


C 


G 


G 


C 


G 


T 


740 


G 


C 


c 


T 


A 


c 


A 


A 


G 


G 


T 


c 


A 


A 


C 


G 


G 


C 


G 


T 


740 


G 


C 


c 


T 


A 


c 


A 


A 


G 


G 


T 


c 


A 


A 


C 


G 


G 


C 


G 


T 


740 


G 


C 


c 


T 


A 


c 


A 


A 


G 


G 


T 


c 


A 


A 


c 


G 


G 


C 


G 


T 


740 


G 


c 


T 


T 


A 


c 


A 


A 


G 


G 


T 


c 


A 


A 


c 


G 


G 


C 


G 


T 


740 


G 


c 


T 


T 


A 


c 


A 


A 


G 


G 


T 


c 


A 


A 


c 


G 


G 


C 


G 


T 


740 


G 


c 


c 


T 


A 


c 


A 


A 


G 


G 


T 


c 


A 


A 


c 


G 


G 


C 


G 


T 


740 



740 



— I — 
750 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 0 8-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



G 
G 
G 
G 
G 
G 
G 
G 



T 
T 
T 
T 
T 
T 
T 
T 



C 
C 

c 
c 
c 
c 
c 
c 



759 
760 
760 
760 
760 
760 
760 
760 



760 



770 



Wild type 

P1B Ol-Hl 

P1B 01-E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
PSB 14-C11 
P4BSF 03-G10 



G 
G 
G 
G 
G 
G 
G 
G 



C 
C 

c 
c 
c 
c 
c 
c 



J — 
780 



790 



T 
T 
T 
T 
T 
T 
T 
T 



C 
C 

c 
c 
c 
c 
c 
c 



800 



810 



G C 
G * C 
G C 



C 
C 
C 

c 
c 
c 
c 
c 



820 



830 



T 
T 
T 
T 
T 
T 
T 
T 



G 
G 
G 
G 
G 
G 
G 
G 



779 
780 
780 
780 
780 
780 
780 
780 



799 
800 
800 
800 
800 
800 
800 
800 



819 
820 
820 
820 
820 
820 
820 
820 



839 
840 
840 
840 
840 
840 
840 
840 
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840 



850 



Wild type 

P1B 01-H1 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



G C 
G C 



G 
G 
G 
G 
G 
G 



G G 
G G 



859 
860 
860 
860 
860 
860 
860 
860 



— 1 — 

860 



870 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



C 

c 
c 
c 
c 
c 
c 
c 



c 
c 
c 
c 
c 
c 
c 
c 



G T 
G T 
G T 
G T 
G 'T 
G T 
G T 
G T 



C 
C 

c 
c 
c 
c 
c 
c 



879 
880 
880 
880 
880 
880 
880 
880 



880 



890 



Wild type 

P1B Ol-Hl 

P1B 01-E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

P5B 14-C11 
P4BSF 03 -GIG 



G 
G 
G 
G 
G 
G 
G 
G 



C 
C 
C 
C 
C 
C 
C 

c 



c 
c 
c 
c 
c 
c 
c 
c 



899 
900 
900 
900 
900 
900 
900 
900 



900 



910 



Wild type 

P1B Ol-Hl 

P1B 01-E4 

P2B 08-H3 

P3B 13-D10 

P4B 04-H3 

P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
P2B 08-H3 
P3B 13-D10 
P4B 04-H3 
P5B 14-C11 
P4BSF 03-G10 



Wild type 

P1B Ol-Hl 
P1B 01-E4 
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I 

# 09/463494 



13/13 



960 



Wildt VP e TlciicccGO? CAGCGTCT 

P p^ Si'-Si accagcccggtcagcgtct 

P2B 08-H3 ACCAGCCCG G 

P3B 13-D10 ACCAGCCCGGTCAGCGTCT 

P4B 04-H3 ACCAGCCC6G CAG G 

P5B 14-C11 A C C A G ° ° ^ ^ ^ r C G C A 980 

P4BSF 03-G10 ACCAGCCCGGTCAGCGTCTA 980 



970 

GGTCAGCGTCTA 979 

- - - ~ A 980 

A 980 

A 980 

A 980 

A 980 



—i — 

980 



— , — 
990 



un , , ' 1 ^ ^ r r A G C A C G*C CAA C C G C C 999 

Wildtype §^^AGCACGCCAACCGCC 1000 

P1B Ol-Hl rcGCCAGCACGCCAACCGCC 1000 

P1B 01-E4 rCGCCAGCACGCCAACCGCC 1000 

P2B 08-H3 rrrCCAGCA-CGCCAACCGCC 1000 

P3B 13-D10 oricCAGCACGCCAACCGCC 1000 

III Itlll gcGCCAGCAclcCAACCGCC 1000 

mLf O^GIO CCGCCAGCACGCCAAGCGCC 1000 



1000 



1010 

GCCAGCCTGTAG 1019 

"""AG 1020 

A G 1020 

A G 1020 



Wild type TGAAGAA S?^ArccTGT 

Jr t GAAGAACGCCAGCCTCal 

P1B 01-H1 TGAAGAACGCCAGCCTGT 

P1B 01 ' E4 ^§AAGAACGCCAGCCTGT 

P2B 08-H3 ^GAAGAACGCCAGCCTGTAG 1020 

P3B 13-D10 X /^aa^aaPGCCAGCCT GTAG 1020 

P4B 04-H3 x^AAGAACGCCAGCCTGTAG 1020 

P5B 14-C11 ISaa^aaCGCCAGCCTGTAG 1020 

P4BSF 03-G10 TGAAGAACGCUAV3^^ 



1020 



1030 



P2B 08-H3 ^ACCCCGGCCGGGGCCTCGG 1040 

P3B 13-D10 rACCCCGGCCGGGGCCTCGG 1040 

P4B 04-H3 ^ArPCCGGCCGGGGCCTCGG 1040 

pIbS^-^O GACCCCGGCCGGGGCCTCGG 1040 



1040 



^ 1047 

Wildtype gggSSSggc io« 

P2B 08-H3 ^ C G G G C C ^ 

P3B 13-DlO rrrrGGCCCrol 1050 

P4B 04-H3 CCCGGGCC C [CSJ ^ 

P5B 14-CX1 CCCGGGCCC ^ 

P4BSF O3-G10 CCCGGGCCO 



r; 
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